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ABSTRACT 

We present an analysis of the clustering of galaxies as a function of their stellar mass at 1 < z < 2 
using data from the NEWFIRM Medium Band Survey (NMBS). The precise photometric redshifts 
and stellar masses that the NMBS produces allows us to define a series of stellar mass limited samples 
of galaxies more massive than 7 x IO^Mq, 1 x lO^^M© and 3 x lO^^M© in three redshift intervals 
centered on z = 1.1, 1.5 and 1.9 respectively. In each redshift interval we show that there exists a 
strong dependence of clustering strength on the stellar mass limit of the sample, with more massive 
galaxies showing a higher clustering amplitude on all scales. We further interpret our clustering 
measurements in the ACDM cosmological context using the halo model of galaxy clustering. We 
show that the typical halo mass of both central and satellite galaxies increases with stellar mass, 
whereas the satellite fraction decreases with stellar mass, qualitatively the same as is seen at z < 1. 
We see little evidence of any redshift dependence in the relationship between stellar mass and halo 
mass over our narrow redshift range. However, when we compare our measurements with similar ones 
at ~ 0, we see clear evidence for a change in this relation. If we assume a universal baryon fraction, 
the ratio of stellar mass to halo mass reveals the fraction of baryons that have been converted to stars. 
We see that the peak in this star formation efficiency for central galaxies shifts to higher halo masses 
at higher redshift, moving from ~ 7 x 10^^/i~^Mq at z ~ to ~ 3 x 10^^h~^ Mq at z ~ 1.5, revealing 
evidence of 'halo downsizing'. Finally we show that for highly biased galaxy populations at z > 1 
there may be a discrepancy between the space density and clustering predicted by the halo model 
and the measured clustering and space density. This could imply that there is a problem with one or 
more ingredient of the halo model at these redshifts, for instance the halo bias relation may not yet 
be precisely calibrated at high halo masses or galaxies may not be distributed within halos following 
an NEW profile. 

Subject headings: cosmology: observations — galaxies: evolution — galaxies: formation — galaxies: 
halos — large-scale structure of universe 
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1. INTRODUCTION 

Understanding the formation and evolution of galaxies 
in a cosmological context remains one of the most chal- 
lenging problems in modern astrophysics. In the current 
cosmological framework, where the mass in the universe 
is dominated by cold dark matter (DM) , luminous galax- 
ies form at the centers of dark matt er halos via the cool- 
ing and condensation of baryons (I White fc ReesI 119781 : 
iFall fc Efstathioul [19801 : iBlumenthal et al.l 119841 ). This 
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means that the properties of galaxies are directly cou- 
pled to those of the dark matter halos in which they live. 

If we wish to understand galaxy formation within this 
context, it becomes important to try to link the observed 
properties of galaxies, such as stellar mass or color, to the 
mass of the halos hosting galaxies with those observed 
properties, to better understand the physical processes 
involved. Making such a direct link can be achieved rel- 
atively easily in massive clusters of galaxies, with X-ray, 
Sunyaev-Zel'dovich effect and strong and weak lensing 
measurements, but it is much more challenging for less 
massive h alos. Dynamical m easurements of bound satel- 
lites (e.g. iMore et aT 20101, or strong (e.g. lAtiger et all 
|2010[) and weak (e.g. Mand elbaum et all |2006[ ) gravita- 
tional lensing, while effective techniques, are observation- 
ally expensive and have thus mainly been used for galax- 
ies in the local universe, with only a few studies ti p to 
z ~ 1 (e.g. lHevmans et al.l [20061 : iConrov et al.ll2007D . 

Measuring the spatial clustering of galaxies provides 
an alternative approach to relating galaxy properties to 
those of the DM distribution. More clustered popula- 
tions must occupy regions of higher dark matter den- 
sity ( i.e. more massive dark matter halos), than less 
clustered populations. The desire to determine the link 
between galaxies and dark matter halos from clustering 
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measurements has led to the development o f the Halo 
Occu p ation Distribution (HOD) framewo r k (iJing et al. | 
199^; 'Ma fc Frvl [2000l: 'Peacock fc SmithI 1200 0^ 'Selial^ 
2000; Scoccima rro et al J 2001; Bcrhn d fc Wein berg 2002- 
Coorav fc Shcth 2002\ The HOD characterizes the sta- 



tistical relationship between galaxies and dark matter 
halos by describing the probability that a halo of a given 
mass hosts a certain number of galaxies with a given 
property. 

The recent completion of large redshift surveys 
in the local univer se such as the Sloan Digital 
Sky Survey (SDSS; lYork et alJ l2000h a nd the Two 
Degree Field Galaxy Redshift survey (jColless et alJ 
1200 ID have allowed precise measurements of the clus- 
tering of galaxies as a function of their intrinsic 
properties, such as lumin osity, color, star f orma- 
tion rate, and mo rphology (Norberg et al.' '2001, 20021 
Zehavi e t aIll2002l:rB udavari et al. 20 03: Madgwick et al 

^006; Swanson et al 



2003"; Zeh avi et al.l 2005; L i et al. _ _ , 

20081; |Ross fc Brunner„2009l; iLoh et~al.,,2010, : ,Ross et al . 
20ld IZehavi et al.ll2010D . This has led to an established 



observational picture with galaxies becoming more clus- 
tered on all scales as their luminosity or stellar mass in- 
creases. As the color becomes redder, or the star for- 
mation rate decreases, the clustering strength again in- 
creases where the magnitude of the increase becomes 
larger on small scales. 

These relationships between galaxy properties and 
clustering strength can straightforwardly be interpreted 
in the framework of the HOD (or cl osely related Con - 
ditional Luminosity Function, CLF; I Yang et all I2003D . 
Such analyses reveal an increase in the typical mass of 
the host halos as the galaxy stellar mass increases, and 
that the distribution of satellite galaxies in massive halos 
is a strong function o f their co lor (or star formation rate; 
Yan et alj _i2003: Yan g et al.l 120 05: Zehavi et al. 2005; 



Zheng et al.i i2007i; iRoss fc Brunner .2009: .Zehavi et al.l 
201011 . Such constraints on the relationship between 



galaxy properties and those of dark matter halos pro- 
vide both insight into the physics of galaxy formation 
and particularly strong tests of any cosmological galaxy 
formation model. 

Observations of galaxy clustering up to a redshift of 
one, from both large spectroscopic and photometric red- 
shift surveys, appear to show similar trends as those 
observed in the local universe. More massive/luminous 
galaxies show stronger clustering and are thus associated 
with more massive halos, and the relat ionships between - 



color and clustering seems t o persist (iCoil et al 
Le Fevre et alll2005t iPhleps et al. 20 0l lCoil et al I 
" ' ""2006; 'Coil et al. 2008; 



2004 



2ooe 



200s 



Polio et al.' '2006; Coil et al. 2008; Meneux et al.l 
McC rackcn et al. 2008; Meneux ct al. 2009; Simon et al.l 
20"09HAbbas et al.ll201Q) . Again the HOD has been effec- 



tively usedjtojnterore these measuremen t s (Yan et al 



200a IPhleps et all[200l iZheng et al.ll2007l: lAbbas et al 



2OIOD, and perhaps even more importantly has allowed 



measurements at several epochs to be combined with 
the evolution of the halo properties to understand the 
evolution of galaxy pro perties i n a cosmo logical context 



jVan ct al. 2003; Conr ov et al l [2 006: Zh eng et al 



White et al..,2007; Wake et al.ll2008ai iBrown et al 



2007 



2008 



Conrov fc Wechsleril2009aHAbbas et al.ll2010i) 

At z > 1 the picture becomes less clear, mainly as a 
result of the difficulty in constructing complete volume 



limited samples of galaxies at these early epochs. The 
most precise clustering measurements have come from 
samples of Lyman Beak Galaxies (LBGs) . These galax- 
ies show strong clustering strengths whi ch depends on 
their luminosity ("Adelberger et al."2005a''b'; Ouchi ct alj 
20051 iLee et al.. 200 6. ,2009; Hildcbrandt et al., 200i 



Bielbv et al. However, these samples comprise of 



relatively blue, unobscured, star-forming galaxies and do 
not represent a complete sample. In particular, the LBG 
selection misses the most massive galaxies, which tend 
to be red and faint in the optical an d require deep near- 
infrar ed imaging for their selection (jvan Dokkum et al.l 
l2006h . 

Several studies of the clustering of z > 1 mas- 
sive galaxies selected using a variety of optical/near- 
infrared color selection techniques have been under- 
taken: Extremely Red O b,iects (E ROs; Daddi et al. | 
Roche et all [2002: Brown et al. 2005; Kong et al.l 
20091: iKim et al l I2010D. BzK s (Kong ct al. 200£ 



200c ; 



2006, 



Havashi et al.l l2007t iBlanc et al.l l2008t iHartlev et atl 



2010D. and Distant Red Galax- 



2008; McCracken et al, 

ies (DRGs iG razian et al.' '2006'; 'Fouca ud et all 120071 : 
lOuadri et aLf l2'007. 2008; Kim ct al. 2010). These stud- 
ies revealed strong clustering and some limited evidence 
for a luminosity and color dependence. However, due to 
the relatively poor quality of the photometric redshifts 
of these samples and the effect the color selection has in 
limiting the range of galaxy types selected, it has been 
difficult to draw any strong conclusions regarding the 
relationship between luminosity or stellar mass to halo 
mass at these redshifts. 

This situation is beginning to change with the advent 
of wide-field near-infrared cameras, which have enabled 
the construction of wide and deep near infra-red selected 
galaxy samples at z > 1. Whilst it is still almost im- 
possible to generate complete spectroscopic samples of 
galaxies at these redshifts, it has been possible to com- 
bine multiple near-IR bands with deep optical imaging 
to produce reasonable photometric re dshifts and stel- 
lar mass estimates. For example, Fouca ud et al.l ()2010l ) 
combine near-IR imaging from the Palomar Observatory 
Wide-Field Infrared Survey with optical imaging from 
the CFHT to define galaxy samples selected by redshift 
and stellar mass at z < 2, based on photometric red- 
shifts accurate to 6z/{l + z) ~ 0.07. They then use these 
samples to measure the stellar mass dependent cluster- 
ing and by using a simple halo model relate, the galaxy 
stellar mass to the dark matter halo mass. 

In this work we make similar measurements us- 
ing the NEWFIRM in edium band survey (NMBS; 
Ivan Dokkum et al.l [20091) . The NMBS combines deep 
near-IR imaging through five medium band filters, with 
multiple deep optical, ultra-violet and IR band imag- 
ing to produce precise iSz/{l + z) < 0.02) photometric 
redshifts and stellar mass estimates. We use these data 
to measure the clustering as a function of stellar mass 
for complete stellar mass limited samples with masses 
> 7 X IO^Mq and 1 < z < 2. We then use the latest halo 
modeling techniques to relate the stellar mass of galaxies 
to the mass of the halos in which they reside. 

In Section [5] we describe the NMBS. In Section [3] we 
describe how we define the stellar mass limited samples 
and the calculation of the correlation function. In Section 
ID we present our measurements of the clustering as a 
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function of stellar mass. We describe the halo model in 
Section [5] and the resulting relationships between stellar 
mass and halo mass in Section [6] and summarize and 
conclude in Section [T] 

Throughout this paper, we assume a flat A-dominated 
CDM cosmology with flm = 0.27, Hq = 73km 
s~^Mpc~^, and as = 0.8 unless otherwise stated. 

2. DATA 

2.1. The NEWFIRM Medium Band Survey 

The galaxy samples are selected from the NMBS, a 
moderately wide, moderately deep near-infrared imag- 
ing survey (|van Dokkum et"alll2009[ ). The survey used 
the NEWFIRM camera on the Kitt Peak 4m tele- 
scope. The camera images a 28' x 28' field with four 
arrays. The gaps between the arrays are relatively small, 
making the camera very effective for deep imaging of 
0.25 deg^ fields. We developed a custom filter sys- 
tem for NEWFIRM, comprised of five medium band- 
width filters i n the wavelength r a nge 1 /im - 1.7 ^m. 
As shown in Ivan Dokkum etlUI (|2009f ). these fihers 
pinpoint the Balmer and 4000 A breaks of galaxies 
at 1.5 < z < 3.5, providing accurate photometric 
redshifts and improved stellar population parameters. 
The survey targeted two 2 8' x 28' fields: a subsec- 
tion of the COSMOS field (jScoviUe et al.l [2007h and a 
field containing part of the AEGIS strip (jDavis et al.l 
l2007fl. Coordinates a nd ot her information are given 



van Dokkum et all ()2009D . Both fields have excel- 



lent supporting data, including ultraviolet (GALEX), ex- 
tremely deep optical ugriz (CFHT Legacy Surve^CT) and 
deep mid-IR (Spitzer IR AC and MIPS (jBarmbv et al.l 
[2006': 'San ders et al.|[2007l) ) imaging. The reduced CFHT 
mosaics we re kindl y provided to us by the CARS team 
(jErben et al. 2009: iHildebrandt et alTl2009h . Addition- 
ally, the COSMOS fields includes deep Subaru BVriz 
and 12 optical medium-band imag et0. The NMBS adds 
six filters: Ji, J2, J3, Hi, H2, and K. Filter char- 
acteristics of the five me dium band filters are given in 
Ivan Dokkum et all (I2009D. a nd the AB zero points can 
be found in lWhitaker etahl (|2010bf l. 

The data reduction, an alysis, and properties o f the 
catalogs are described in iWhitaker et al.l ()2010bl ). In 
the present study we use a if-selected c atalog based on 
the full NMBS data set (as described in IWhitaker et al.l 
[2010b) . AU optical and near-IR images were convolved 
to the same point-spread function (PSF) before mea- 
suring aperture photometry. Following previous studies 
(|Labbe et al.l 120031 : lOuadri et all 120071 ) photometry was 
performed in relatively small "color" apertures to op- 
timize the S/N ratio. Total magnitudes in each band 
were determin ed from the S Extractor AUTO aperture 
flux (Berlin fc ArnoutsI 1199(f ). with an additional aper- 
ture correction computed from the K band growth curve. 
The aperture correction is a point-source based correc- 
tion that accounts for flux outside of the AUTO aperture. 
We note that about 10% of the objects detected by SEx- 
tractor are classified as single objects but are actually 
blended. We use a deblended cat alog here and refer the 
reader to IWhitaker et al.l (|2010b[ ) for the details of the 

http:/ /www. cfht.hawaii.edu/Science/CFHLS/ 
http://irsa.ipac.caltech.edu/data/COSMOS/images/ 



deblending algorithm employed. 

Pho tometric redshifts were determined with the EAZY 
code (jBrammer et al.ll2008D . using the fufl NUV-8/Ltm 
spectral energy distributions (SEDs) (NUV-i^ for ob- 
jects in the ~ 50% of our AEGIS field that does 
not have Spitzer coverage). Publicly available redshifts 
in the COSMOS and AEGIS fields indicate that the 
reds hift errors are very small at az/(l + z) < 0.02 
(see lBrammer et al.|[2009HWhitaker et al.ll2010al [bl). Al- 
though there are very few spectroscopic redshifts of 
optically-faint iiT-selected galaxies in these fields, we note 
that we found a similarly sm all scatter in a. pilot program 
targeting galaxies from t he iKriek et al.l (I2008D near-IR 
spectroscopic sample fsee Ivan Dokkum et al. Ii009). 

Each galaxy within the survey is assigned a weight in 
each of the NMBS bands based on the fraction of the 
maximum exposure present at the galaxies position. In 
order to ensure both a minimum and even S/N coverage 
for our samples, we only use galaxies in areas that have 
a minimum weight (wmin) > 0.3, i.e. the least exposed 
optical to near IR band has at least 30% of the exposure 
time as the most exposed part of the image. The regions 
around bright stars are also excluded from our analy- 
sis, as faint galaxies in these regions are either obscured 
by the foreground star or have systematically incorrect 
magnitudes due to scattered light. After the removal of 
these regions the total remaining area is 0.201 deg^ in 
the AEGIS field and 0.189 deg^ in COSMOS. 

Stellar masses and other stel lar population par ameters 
were determi ned with FAST (IKriek et all [2001) using 
the models of lMarastonI (j2005t) . the lCalzetti et all ()2000D 
reddening law, and exponentially declining star forma- 
tion hi stories. Masse s and star formation rates are based 
on a Krouoa (2001) initial mass function (IMF); fol- 
lowing Branimer ct al. (2008) rest-frame near-IR wave- 
lengths are down- weighted in the fit as their interpre - 
tation is uncertain (see, e.g .. Ivan der Wei etaP [20061) . 
More details are provided in iBrammer et al.l (|2009[ ). 

There exist significant systematic uncertainties in the 
determination of galaxy stellar masses based on uncer- 
tainties in the IMF, stellar population synthesis model, 
extinction law and star formation history which can re- 
sult in unc ertainties in the mass of a factor of a few 
(e.g. see 'Marchesini etalj 120091: iMuzzin et all 120091: 
iConrov k W echsler 2009i). In this work our mam in- 
terest is in determining the dependence of the clustering 
amplitude on stellar mass and then characterizing that 
dependence in terms of halo mass. What we are really in- 
terested in is not the absolute determination of the stellar 
mass but the rank order, as we wish to find all galaxies 
above some stellar mass limit. Whilst the systematics 
may cause some scatter in this order it is much less than 
the systematic error on the overall normalization. 

3. THE 2PT-C0RRELATI0N FUNCTION 

3.1. Stellar Mass Limited Samples 

The aim of this paper is to investigate how the clus- 
tering of galaxies at 1 < z < 2 depends on their stellar 
masses, and, with the use of the halo model, determine 
the relationship between stellar mass and halo mass. The 
simplest, most robust and systematic free approach is 
to define volume limited samples, with a variable stellar 
mass limit but constant volume. With such samples we 
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Fig. 1. — Example of redshift distributions in the three redshift intervals calculated by summing the photometric redshift PDFs 
of all galaxies with each galaxy PDF weighted by the fraction of the probability lying within the redshift interval in question 
(see text for details). For each redshift interval we show the redshift distribution for lowest stellar mass limited sample (solid 
lines) and for stellar masses > 5 x IO^^Mq (dashed lines). Large scale structures are clearly visible in the redshift distributions 
reflecting the accuracy of the photometric redshifts. 



may directly compare the angular correlation functions 
of different stellar mass limited samples, and make use 
of the simplest implementation of the halo model. 

In order to study redshift evolution we first define three 
galaxy samples with slightly overlapping redshift inter- 
vals, 0.9 < z < 1.3, 1.2 < z < 1.75 and 1.6 < z < 2.2. 
Within each redshift range we define several mass limited 
samples where the lowest stellar mass limit is defined by 
the stellar mass limit of the survey at the highest red- 
shift in the redshift bin. The highest stellar mass limit is 
defined so that there are sufficient galaxies to make a rea- 
sonable measurement of the correlation function. These 
samples are described in Table [1] 

We estimate the stellar mass completeness limit at a 
given redshift as follows. We rank order all of our galaxies 
by redshift. At the redshift of interest we select the next 
1000 galaxies of lower redshift. For those 1000 galax- 
ies we find the 90th percentile in the K-band mass-to- 
light ratio. Finally the stellar mass completeness is de- 
termined as the mass a galaxy would have with this 90th 
percentile mass-to-light ratio at the 99 % K-band com- 
pleten ess limit of the survey (K = 22.8; IWhitaker et al.l 
l2010bD . For example, for our 0.9 < z < 1.3 sample we 
wish to estimate the completeness at z = 1.3 and so se- 
lect the 1000 galaxies lying immediately below that red- 
shift which corresponds to the range 1.258 < z < 1.3. 
We then find the 90th percentile of the K-band mass- 
to-light ratio of these 1000 galaxies. At K — 22.8, the 
completeness limit of our survey, this corresponds to a 
stellar mass of 7.08 x 1O^M0. Since the galaxies we use 
in this calculation are at lower redshift than our target 
(z = 1.3) they will be more complete in stellar mass than 
the actual galaxies at z = 1.3 yet are sufficiently close in 
redshift to be representative of those galaxies. 

To determine which galaxies are included in any given 
redshift interval we make use of the full redshift probabil- 



ity distribution functions (PDFs) output by the EAZY 
photometric redshift code. All galaxies that have any 
non-zero part of their PDF within the given redshift in- 
terval are included in our samples. However, each is 
given a weight equal to the fraction of the total prob- 
ability that is within the interval. This has the effect of 
down weighting galaxies with uncertain redshifts that lie 
close to the edge of the desired redshift interval, or those 
with multiple peaks, where one peak is within and one or 
more is outside. We use these weights throughout when 
calculating redshift distributions, space densities and the 
angular correlation function. Table [T] lists the total num- 
ber of galaxies contributing to each sample {Ncai), the 
sum of the weights of these galaxies (A'^d), which gives 
the effective number of 'true' galaxies in the sample, and 
the mean redshift of the weighted sample. 

Figure [1] shows the redshift distributions of two sam- 
ples with different stellar mass limits from each redshift 
interval, calculated using the PDF weights. Whilst there 
are many galaxies with some probability outside of the 
redshift intervals the effect of the weight combined with 
the accuracy of the photometric redshifts is to cause a 
relatively sharp transition. Large scale structures are 
clearly visible in the redshift distributions reflecting the 
accuracy of the photometric redshifts. The same struc- 
ture is repeated for all stellar mass limits. 

The 2pt-correlation function is a straight forward way 
to measure the spatial clustering of our galaxy samples, 
and when combined with the space density can produce 
very strong constraints on the distribution of galaxies 
within dark matter halos. Since we do not have suffi- 
ciently precise redshift information for our sources, we 
choose to calculate the angular correlation function and 
then relate it to the real space cor relation function using 
Limber's equations (|Limbeiill954l) . 
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TABLE 1 

Details of the stellar mass limited samples and power law fits to their correlation functions 
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Note. — Stellar masses (SM) are in units of W^'^Mq, the space density (p) has units 10 '^h^'Mpc ^. The errors are 1 sigma. 



3.2. The angular correlation function 

The 2pt-angular correlation function, w{6), is defined 
as the excess probability above Poisson of finding an ob- 
ject at an angular separation 9 from another object. This 
is calculated by comparing the number of pairs as a func- 
tion of angular scale in our galaxy catalogs, with the 
number in a random catalog, which covers the same an- 
gular reg ion as our data. We m ake this measurement 
usmg the lLandv fc Szalavl (|1993[ ) estimator. 



w{9) = 



RR{e) 



DD(0) I ^ 
no 



2DR{0) { — ]+ RR{6) 
no 



(1) 

where DD[9), DR{9) and RR{9) are data-data, data- 
random and random-random pair counts respectively, 
and no and ur are number of galaxies in the data and 
random catalogs. 

We generate random catalogs for each galaxy sample 
following the angular masks of the survey, i.e. excluding 
areas with Wmin < 0.3 and those around bright stars. 
The random catalog has a constant space density and 
at least 20 times the number of random points as data 
points. 

As discussed in Section [3Tl we associate a weight with 
each galaxy calculated as the fraction of the photometric 
redshift PDF that lies within a given redshift slice. When 
calculating the correlation function each pair is weighted 
as the multiple of the weights of each galaxy in the pair 
and the pair count is then the sum of the weights over 
all pairs in the angular bin. Similarly the normalization 
factor ud is the sum of the weights of all galaxies in 
the sample and is given in Table [TJ This means that 
galaxies that are most likely to lie within our desired 
redshift interval are given more weight in the correlation 
function calculation, which should lead to a higher signal 
to noise measurement. This scheme i s in e ssence similar 
to the one proposed bv'M vers et al.l (|2009[ ). 

Our weighting is equivalent to a Monte Carlo approach 
of randomly assigning a redshift to each galaxy based on 
its PDF, applying the redshift cut, calculating the cor- 
relation function, repeating many times and finally cal- 
culating the mean of all these correlation functions. We 
verify this by applying this procedure to one of our galaxy 



samples calculating the correlation function 100 times 
and find the mean correlation function agrees with the 
weighed correlation function to within 1% on all scales. 
We find a variance of <10% on all scales which indicates 
the expected error due to the photometric redshifts if one 
were to use a single sample that utilized the best photo- 
metric redshift estimate. The weighting scheme should 
yield errors due to the photometric redshift uncertainties 
somewhat smaller than this. 

We note that if we just use the best photometric red- 
shift rather the PDF, selecting all galaxies that have a 
best photometric redshift within a given redshift range 
and using no weight, the resulting angular correlation 
functions are almost identical to those calculated with 
the PDF weighting. This is consistent with the error 
estimated from the Monte-Carlo approach. This again 
reflects the accuracy of our photometric redshifts since 
most of the galaxies selected in this manner have all of 
their redshift PDF within the redshift range. 

The calculated angular correlation functions for the 
lowest stellar mass limited samples in all three redshift 
ranges are shown in Figure [2] They show the characteris- 
tic power law shape, with evidence of a break at IMpc 
becoming more apparent as higher redshift as expected 
by the halo model (see Section [5]). 

3.3. Integral constraint 

Since our fields cover a relatively small area, we expect 
the integral constraint (jGroth fc Peebleslll977D to have a 
significant effect on our clustering measurements, lead- 
ing to the underestimation of the clustering strength by 
a constant factor (/C) . IC is equal to the fractional vari- 
ance of the galaxy counts on the size of the field. There- 
fore, the magnitude of IC depends on both the field size 
and the clustering strength of the sample, increasing as 
the field siz e decrease and the cl ustering increases. 

Following 'Infant(^ (j 19941) and IRoche et al.1 ()1999D we 
numerically estimate IC using 



IC 



Y..^w{9,)RR{9,) 



(2) 



This estimate is often made using an iterative process: 
A model for w{9i) is fit to the observed w{9) and IC 
is calculated using equation [2l This correction is then 
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applied to the observed w{6), the model re-fit, and IC 
recalculated until there is convergence. It is typical to 
assume a power-law as the functional form of w{9), how- 
ever, unless the slope is fixed this iterative process tends 
to produce large values of IC and flat slopes. We there- 
fore choose to determine the integral constraint when 
fitting the halo model (see Section [SJ . The halo model 
almost exactly reproduces the shape of the correlation 
function, something a simple power-law does not. Dur- 
ing the fitting process IC is calculated for each model 
correlation function and subtracted from the model be- 
fore being fit to the data. As a test of our procedure we 
also estimated IC using the mock galaxy catalogs we gen- 
erated from the Millennium simulation to calculate the 
errors on the correlation function (see Section [3?5|) . The 
difference between the correlation function calculated for 
the full simulation and the mean of the correlation func- 
tions in the multiple sub- regions the size of our fields give 
a direct measurement of IC. We find that both methods 
give consistent results with values ranging from 0.0150 ± 
0.0005 to 0.033 ± 0.001. 

3.4. The Real Space Correlation Function 

Given a redshift distribution, the angular correla- 
tion function can be directly determined from the real 
space correlat ion function, ^(r), using Limber's equation 
(|LimbeHll953 ) 

w{d) = - / dzH{z) f j / du^ir= y/u^+x^z)9^) 

(3) 

where c is the speed of light, II{z) , the Hubble constant 

at redshift z, is given by II{z) — i/o-\/r2j\/(l + z)-^ + JIa 
assuming a flat universe, dn/dz is the normalized redshift 
distribution, and x{z) is the comoving distance to the 
median redshift. 

Conversely, if a functional form is assumed for the real 
space correlation function then an accurate measurement 
of both the angular correlation function and the redshift 
distribution can be used to determine £,{r). For the case 
where ^(r) is a power law. 



then w{9) is also a power law 



w{e) = A^e^ 

where (5 = 7—1 and 



(4) 



(5) 



^»-fr(i/2)-^ 



dn 



dz H{z) ( — ) x{z) (6) 



with r indicating the gamma function. We make use 
of Equation [3] when fitting the halo model in Section [5] 
and Equation [5] when comparing clustering amplitudes 
in Section m 

In both Equations |3] and [H] dn/dz is the normalized 
redshift distribution for a given sample without any clus- 
tering, i.e. it reflects the selection function of the galaxy 
sample. Large scale structure is clearly visible in our red- 
shift distributions and so we remove its effects by making 



a polynomial fit to each redshift distribution. We note 
that despite the presence of visible structure using the 
fit rather than the measured dn/dz makes essentially no 
difference to any of our results. Even though our photo-z 
errors are small enough to reveal large scale structure the 
rms error still corresponds to ~ 60h~^Mpc, sufficiently 
smearing out the clustering signal on large enough scales 
such that it has a negligible effect. 

3.5. Estimating Errors 

Whether we wish to make comparisons between the 
clustering of our samples or fit models to the clustering 
we need to make an accurate estimate of the measure- 
ment errors and the correlation between the data points 
in the form of a covariance matrix. 

There are a number of ways in which the errors on clus- 
tering measurements maybe estimated; simple Poisson 
errors, internal estimates such as jackknife and bootstrap 
re-sampling and mock galaxy catalogs based on N-body 
simulations or analytical halo distributions. Poisson er- 
rors are known to be an underestimate of the true error, 
particularly on large scales, and do not provide an esti- 
mate of the covariance. 

Both jackknife and bootstrap re-sampling methods 
have been shown to produce reasonable estimates of the 
full covariance (e.g. IZehavi et all [20051) bu t may suffer 
from some systematic issues ( Norberg et al.|[2009() . They 
are also only really effective where the scales of interest 
are significantly smaller than the region of the survey 
being removed in each re-sampling. 

Mock catalogs, in which simulated galaxies have the 
same clustering properties as the real galaxies, may pro- 
vide the best error estimates since large numbers of in- 
dividual surveys can be created and the full covariance 
between them accurately calculated. The only potential 
problem with this method is that there could be higher 
order clustering effects not encoded in the 2-point corre- 
lation function from which the mocks are generated. 
We have chosen to use m ock catalogs generat ed from 



We nave cnosen to use m ocR catalogs generat ed iron 
the Millennium simulation (jSpringel et al.l[2005t ) to gen 
erate the covariance matrices we use for all fits to the 
measured correlation functions. We discuss in Appendix 
[XI the details of our approach, and the reasons for this 
choice. In brief, we first fit a Halo Occupation Distri- 
bution (HOD; see Section [5]) to the correlation function 
and space density of the observed NMBS samples, using 
an estimate of the errors from jackknife re-sampling. We 
then populate halos according to this HOD in the full 
Millennium simulation box at the appropriate redshift 
for the sample. The Millennium box is then split in to 
multiple sub-regions with the same geometry as the sur- 
vey fields and the correlation function is calculated for 
each. The covariance matrix is then generated from the 
correlation functions of each sub-region. 

We note that when comparing the clustering between 
our stellar mass limited samples these errors may well be 
an overestimation since we assume that each measure- 
ment is independent, when in fact it is made within the 
same volume. They are the correct errors if one wished to 
compare with a similar measurement made in a different 
region of sky. We discuss this issue further in Appendix 

m 

3.6. Uncertainties in the redshift PDFs 



Galaxy clustering in the NEWFIRM Medium Band Survey 



comoving separation (h ^ Mpc) 
10 0.1 1 10 0.1 1 




6 (degrees) 

Fig. 2. — The angular correlation function for the AEGIS field (blue), the COSMOS field (red) and both fields combined 
(black) for the lowest mass limit samples in each redshift interval. The top axis shows the angular scale converted to a comoving 
separation at the mean redshift of each sample. All the correlation functions show the typical power law form and do show some 
evidence of a break at around 0.01 degrees, particularly in the highest redshift bin. There is also evidence of cosmic variance 
between the two fields at a level consistent with the magnitude of the errors and given the covariance on large scales. 



As described above, the photometric redshift PDFs 
generated with the EAZY code are used twice in our 
analysis; first, to assign a weight to each galaxy in the 
correlation function calculation, and second to calculate 
the redshift distribution which is used to convert the an- 
gular to the real space correlation function. Although 
these PDFs are calibrated against the thousands of spec- 
troscopic redshifts in our fields, the relative lack of spec- 
troscopic coverage in the 1.2 < z < 2.2 interval of interest 
to this paper means we cannot be absolutely certain of 
their reliability in this range. That being said, we do have 
good reason to expect any uncertainty in the PDFs to 
have a small effect on our measurements. Only galaxies 
that have some part of their PDF outside of the redshift 
range of the sample are being affected by the use of the 
PDFs in our w{9) calculations. Since the redshift inter- 
vals that we select are much broader than the PDFs, the 
number of these galaxies should be small, consisting of 
those lying close to the redshift boundaries or those with 
multiple probability peaks that are widely separated. 

In order to explicitly test the effect of uncertainties in 
the PDFs, we modify the calibration to generate two new 
sets of PDFs that are approximately either half or twice 
the width of the best estimate of the true PDF. While 
we are confident that our calibration is more accurate 
than this factor of two, these new samples will allow us 
to determine the effect in a worst-case scenario. 

We rerun the sample selection, clustering calculations 
and halo model fits (see Section [5]) for all of the samples 
using both new sets of PDFs. We find that, at most, 
the clustering amplitude is changed by 4%. As expected, 
this occurs for the samples with the lowest stellar mass 
limit at each redshift, thus the faintest galaxies, where 
the PDFs are already broadest. When we fit the halo 
model, which includes the effect of the modified PDFs 
on the redshift distribution and the space density as well 



as the clustering, we find that the HOD parameters are 
changed by at most 6%, again for the lowest mass limit. 
It is interesting to note that there is an even smaller 
effect on the derived parameters such as the bias, mean 
halo mass and satellite fraction, which change by no more 
than 2%. 

Since these modified PDFs represent something of a 
worst-case scenario, and any differences are significantly 
less than the measurement errors, we are confident that 
any inaccuracies in the PDFs are not a concern for our 
analysis. 

It is important to note that while the above test 
demonstrates that our analysis is relatively insensitive 
to the photometric redshift PDF accuracy, within rea- 
sonable limits, it does not mean that we could produce 
similar quality measurements of the dependence of clus- 
tering on stellar mass with less accurate photometric red- 
shifts. A reduction in the redshift accuracy would in- 
crease the error on the stellar mass estimates, causing 
our mass limits to be poorly defined. This would cause 
the samples we define to become increasing less close to 
being volume limited, making the standard halo model 
assumptions invalid, and the fraction of catastrophic fail- 
ures would increase, causing systematic variations in the 
redshift and stellar mass distributions. 

3.7. Field to Field Variation 

Figure [2] shows the angular correlation function of the 
lowest mass limited sample in the three redshift intervals 
in both the AEGIS and COSMOS fields separately and 
for both fields combined. The errors plotted in these cor- 
relation functions, and in any further correlation function 
plots, are the square root of the diagonal terms of the co- 
variance matrices generated using the mock catalogs. A 
correction due to the integral constraint calculated from 
the HOD model has been applied. 
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Visually there are differences between the correlation 
functions from the two fields: the COSMOS field shows 
stronger clustering at z = 1.1 on large scales, while the 
opposite is true at z = 1.5. Within these redshift bins 
the same trends continue for the higher mass limited sam- 
ples. This variation is to be expected, as cosmic variance 
will play a role for fields of this size. However, it should 
be noted that the differences in the correlation functions 
between the fields are only of marginal significance, par- 
ticularly when the full covariance is taken into account, 
due to the highly correlated nature of the largest scale 
measurements . 

In the remainder of the paper we will consider only 
measurements from the combined fields, reducing both 
the statistical and cosmic variance errors. 

4. CLUSTERING AS A FUNCTION OF STELLAR 

MASS 

The goal of this paper is to investigate how the clus- 
tering of galaxies depends on galaxy stellar mass, and 
thus determine the relationship between stellar mass and 
dark matter halo mass at redshift 1 < z < 2. In this 
Section we present our 2-point correlation function mea- 
surements for the mass limited samples and then test to 
see if there is a significant dependence on stellar mass. 

Figure [3] shows the 2-point correlation functions for 
all of the stellar mass limited samples in the three red- 
shift intervals. A trend is visible on all scales and at 
all redshifts of an increasing clustering amplitude with 
increasing stellar mass limit. 

When considering angular correlation functions, it is 
always important to remember that the amplitude de- 
pends on both the intrinsic clustering of the population 
(^(r)) and the normalized redshift distribution (equation 
[3]). However, our samples are defined to be very close to 
being volume limited, resulting in the normalized red- 
shift distributions being very similar for all of the mass 
limits within a single redshift interval. It is therefore rea- 
sonable to make a direct comparison between the angular 
clustering measurements in this case. 

To quantify the significance of the stellar mass depen- 
dent clustering, we calculate the between the corre- 
lation functions for the lowest and highest stellar mass 
samples in each redshift bin using the covariance ma- 
trices for both samples. We choose to fit in the range 
0.0011 < d < 0.11 degrees. The lower limit is chosen 
to ensure we are not being affected by any remaining 
deblending issues; the upper limit represents where the 
data become very poorly measured and highly correlated 
with the smaller scale points. 

This test shows that in all three redshift intervals 
there is a significant difference between the high and low 
stellar mass correlation functions. The most significant 
difference is for the z = 1.1 sample where we find — 
22.87 with 10 degrees of freedom (dof) and a probabil- 
ity of 1.1% of the high and low stellar mass clustering 
measurements being the same. At z = 1.5 we measure 
a = 21.01 with 10 rfo/and a probability of 2.1% and 
at z = 1.9 we measure a — 18.75 with 10 dof and a 
probability of 4.3%. While the highest redshift sample 
has the lowest significance, showing a trend with stellar 
mass at 95%, it also has the smallest stellar mass range, 
a factor of 3.33 compared to factors of 7 and 6 for the z 
= 1.1 and z = 1.5 respectively. 



As we have already discussed, differences in the redshift 
distributions between our mass limited samples could re- 
sult in differences in the amplitude of the clustering that 
are not intrinsic. Even though we do not believe that 
this is a significant issue for these data it is reassuring to 
make tests that are independent of this effect. In Section 
[5] the halo model fits will be free from this issue since the 
redshift distributions are used in the halo model calcu- 
lations. But first we shall use a simpler approach, which 
has long been used in the literature, of modeling the 2- 
point correlation function as a power-law. 

Initially we fit a power-law to wlO) (equation[5]) leaving 
both the normalization (Ay^,) and slope (6) as free param- 
eters. We choose to fit both over the full angular range 
of the correlation function (0.0011 < 9 < 0.11 degrees) 
and over a restricted range (0.011 < 6 < 0.11 degrees) 
which just covers the large scale clustering. Within the 
halo model there are two terms that contribute to the 
overall correlation function: the 1-halo term from pairs 
within halos, and the 2-halo term from pairs between ha- 
los. This results in a characteristic feature at the scale 
of the typical halo size and can often result in a change 
of slope at this scale (pcrhnd & Weinberg 2002). This 
transition is well esta blished in measurem ents of the cor- 
relation function (e.g. lZehavi et al.ll2004D and can be seen 
in the correlation functions shown in Figure [31 

Since we want to investigate how the amplitude of the 
clustering varies with stellar mass we must fix the slope 
and just fit for the amplitude. We find that slopes (6) 
of 0.8 for all scales and 0.6 for large scales are consistent 
with the individual best fits for all of the samples and so 
fit with the slopes fixed to these values. Using the best 
fitting amplitudes, the redshift distributions and equa- 
tion 111 we can calculate the correlation length of the real 
space correlation function (ro). 

Figure |4] shows the best fitting values of ro as a func- 
tion of stellar mass for the three redshift intervals (see 
Table [1] for the values) . Significant trends of an increas- 
ing correlation length with stellar mass limit are present 
in all three redshift intervals and for the fits to both an- 
gular ranges, with the exception of the large scale fit at 
2 = 1.1 where the trend is not very significant. This con- 
firms the results of the direct w{6) comparisons above, 
showing that the clustering strength does depend signif- 
icantly on stellar mass at 1 < z < 2. 

5. HALO MODEL ANALYSIS 

The halo model (see Cooray & Shcth 2002 for a review) 
assumes that the galaxy clustering signal encodes infor- 
mation about the Halo Occupation Distribution (HOD) 
- how the galaxies populate Dark Matter halos - in par- 
ticular, how the HOD depends on halo mass. In essence, 
the HOD describes the probability distribution that a 
halo of a given mass (M) host a certain number (N) of 
galaxies of given type, P{N\M). 

In the halo model, every galaxy is associated with a 
halo and all halos are 200 times the background den- 
sity whatever the mass M of the halo. Sufficiently mas- 
sive halos typically host more than one galaxy. The halo 
model we use distinguishes between the central galaxy in 
a halo, and the other galaxies, which are usually called 
satel lites. This approach is motivated by si mulations 
(e.g. iKravtsov etall 120041: iZheng et al.1 12005D and has 
been a standard assumption of semi-analytic galaxy for- 
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Fig. 3. — The angular correlation function as a function of stellar mass limit in each redshift interval. The the stellar mass 
limits in units of AIq are given in the legend. For clarity error bars are only shown for the lowest and highest stellar mass limits. 
The top axis shows the angular scale converted to a comoving separation at the mean redshift of each sample. The solid lines 
(bottom panel) show the best fitting HOD model fit to both the clustering and density, whereas the dashed lines (top panel) are 
fits to the clustering only (see Section [5] for details). In each redshift interval the clustering amplitude increases as the stellar 
mass increases. The HOD fits show a similar increase in amplitude with stellar mass. When the space density is included in the 
fits the clustering amplitude at large scales is reduced; this is particularly noticeable at z = 1.5 but is present at all redshifts 
(see Section [5. l.ip . 



mation models for many years (e.g.'Baugh"2006'). There 
is now strong observational evidence that these two types 
of galaxies are indeed rather different, and that the 
halo model parameterization of this difference is accu- 
rate (jSkibba. Sheth fc Marti^l2007[ ). 

When considering a sample of galaxies with a fixed 
luminosity or stellar mass limit, an HOD that is close 
to a step function for central galaxies and a power law 
for satellites is a reasonable approximation . We choose 
to use the parameterization introduced by iZheng et al.l 
(|2007| ) which has a soft cut off in the central galaxy 
HOD, allowing for the scatter in the stellar mass halo 
mass relation, and a cut in the satellite power law at low 
halo mass. This five parameter analytic HOD was moti- 
vated by the desir e to match the HO Ds from the simula- 
tions presented in lZheng et al.l (|2005l ) and has since been 
shown to precisely reproduce the measured clustering in 
luminosity limited samples, which favor both the varying 



soft central cut and the satellite cut fZheng et al."200'5 
Brown et all 120081: iBlake et al.l i20 08: Ross & Brunn^ 
20091: iRoss et al.l 120101 : iZehavi et al.ll20Tcit ) . Our cluster- 
ing measurements are not sufficiently precise to accu- 
rately constrain all five parameters simultaneously, and 
so we choose to fix several of them in our analysis (see 
15.11) . However, we choose to keep the five parameter form 
to enable an easier comparison to other work at lower 
redshifts. 

The details of th e halo model calcula tion are similar to 
those presented in I Wake et"al1 (|2008ai rbl) , however there 
are significant differences which we describe here. We 
present a full description of our halo model calculation 
in Appendi x IB] 

Following I Zheng et al.l (|2007l ). the fraction of halos of 
mass M which host centrals is modeled as 



{Nc\M) = I 



1 + erf 



logM - logM„ 

^logM 



(7) 
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Fig. 4. — The spatial correlation length, ro, as a function of 
stellar mass limit in the three redshift ranges. Values of ro 
determined from power law fits to the full correlation function 
with the slope fixed at 1.8 (open circles) and from fits to just 
the large scales with the slope fixed at 1.6 (filled circles) are 
shown. As the stellar mass increases the clustering amplitude 
in real space also increases. 

Only halos which host a central may host satellites. 
In such halos, the number of satellites is drawn from a 
Poisson distribution with mean 



{Ns\M) 



M - Mq 
Ml 



(8) 



Thus, the mean number of galaxies in halos of mass M 

{N\M) = {N,\M)[1 + {N,\M)], (9) 
and the predicted number density of galaxies is 



dMn{M) {N\M), 



(10) 



where n{M) is the halo mass function, for which we 
use the latest parameterization given by [Tinker et al.l 
dlOlOb). 

We further assume t hat the satellite galax ies in a halo 
trace an NFW profile ([Navarro et al.|[l996l) around the 
halo center, and that the halos are biased tracers of the 
dark matter distribution. The halo bias (6(M)) depends 
on halo mass in a way that can be estimated directly from 
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Fig. 5. — An example HOD showing the mean number of 
galaxies per halo as a function of halo mass. The dashed 
line shows the central galaxy distribution, the dotted line 
the satellite distribution and the solid line the total. Values 
of the mass scales Afmin, M[ and Mq are indicated by the 
arrows. The affects of criog m and a are also indicated. In brief 
Mmin and M[ are the mass thresholds for central and satellite 
galaxies respectively, aiog m controls the rate of truncation of 
the central galaxy distribution, a is the slope of the satellite 
power law distribution, and Mq is the cut off mass for the 
satellite power law. 



the halo mass function fSh eth fc Tormenl '1999'). a nd we 
use the most up to date parameterization of Tink er et al.l 

In IWake et al.l (|2008ai rbl) we used the linear theory 
power spectrum [Phinik)) throughout the calculation, 
whereas we now use the non-linear power spectrum at 
the redshift of interest when calculating the 2-halo term. 
We also apply the scale dependent bias and halo exclu- 
sion corrections given bv lTinker et alj (|2005f ). 

With these assumpt ions the halo model for $(r) is com- 
pletely specified (e.g. iCoorav fc ShethI [20021 ) . We then 
calculate w{9) from ^(r) using equation ([3|). 

In addition to ^(r), we are interested in the satellite 
fraction. 



(11) 



Fsat = / dMn{M) {N,\M) {Ns\M)/ng, 



the fraction of the galaxies in a given sample that are 
satellite galaxies in halos, and two measures of the typical 
masses of galaxy host halos: an effective halo mass 



Me// j dMMn{M) {N\M)/ng, 
and the average effective bias factor 



ba = j dMn{M)b{M) {N\M)/ng, 
where h{M) is the halo bias. 



(12) 



(13) 
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We show in Figure [S] an example HOD where we indi- 
cate the effect of each of the five parameters in our model, 
-^min is the mass threshold for central galaxies, and rep- 
resents the halo mass which hosts on average 0.5 galaxies 
above the stellar mass limit, tiiog m determines the cutoff 
profile for the central galaxies with higher values corre- 
sponding to a more gentle cutoff. Bo th Mmin and oiogM 
also have more physical meaning fsee lZheng et"all (|200l 
and iZehavi et al.l ()201C1[) for details). In brief, the value 
of Mmin for a sample of galaxies with a stellar mass limit 
SM^im corresponds to the halo mass that hosts central 
galaxies with a median stellar mass of S Admin ■ ciog m is 
proportional to the scatter in the stellar mass of galax- 
ies living in halos of mass Mmin- The halo occupation of 
satellite galaxies are described by the characteristic mass 
M[ , the slope of the power law a and the low mass cut 
off Mo . 

5.1. HOD Fits 

The HOD defined by equations [7] and [8] contains five 
free parameters: Mmin and uiogM for central galaxies 
and M{, Mq and a for satellite galaxies. Our correlation 
functions are not sufficiently accurate to precisely con- 
strain all five of t hese parameters. In previous studies 
of the HOD (e.gJZheng et al.l [20071: iBrown et al.] [20081: 
IZehavi et al.ll2010D it has been found that Mg ~ Mmin 
and we fix this relationship when fitting the HOD. Fur- 
thermore, we find that both CTiogAf and a are very poorly 
constrained by our measurements. There are reasonably 
strong theoretical arguments based on the distribution of 
sub- halos for a j:^ 1 for all stellar mass limited samples 
([Kravtsov et al.ll2004l: [Zheng et al.ll2005D . which are sup- 
ported by pr ecise clustering me asurements in the local 
universe fe.g. IZehavi et al.ll2010l ). As a result of this and 
since a has some considerable degeneracy with M[, we 
choose to fix a. We find that a = 1 is an acceptable fit 
(< la) for all samples and so we fix it to this value in 
the remainder of the analysis. 

There is also some degeneracy between Mmin and 
ciogM, and since ciogM is so poorly constrained we 
choose to fix it also. There is substantial observational 
evide nce that aiosM in creases with incr easing stellar 
mass (jZheng et al.ll2007HBrown et al.|[2008l: IZehavi etahl 
[2010I) . and we see a hint of this when we allow this pa- 
rameter to be free. However, the trend is not significant 
in our measurements, and we find all samples are consis- 
tent with (TiogAf = 0.15 (< Icr), so we fix it to this value. 
We are thus interested in measuring how the two mass 
thresholds for central and satellite galaxies (Mmin and 
M{) depend on the stellar mass limit of our samples. 

A given HOD predicts both the clustering and space 
density of a galaxy population and so both can be used 
when fitting. The inclusion of the space density provides 
particularly strong constraints on Mmin and since it can 
be expected to be very well measured it is a very useful 
constraint on the HOD overall. The dominant error on 
the space density, like the clustering, is cosmic variance; 
we estimate this error using the same mock catalogs used 
for the clustering error estimates and find that the typical 
error on the space density to be about 15%. 

Tables [Hand [3] give the best fitting values of the HOD 
parameters Mmin and M[ and their la errors for fits to 
just the clustering and the clustering and space density 
simultaneously for all samples. These Tables also con- 
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Fig. 6. — The galaxy bias as a function of stellar mass limit 
in the three redshift intervals. The bias is measured in three 
ways. The blue points are calculated by fitting the non-linear 
dark matter correlation function to the measured correlation 
function on scales > 0.024 degrees. The other two bias mea- 
surements are determined from the full HOD fits, the black 
stars from fits to the clustering and the red squares from fits to 
both the clustering and space density. There is some evidence 
of an oflset between the bias determined from the clustering 
and the bias from the fit to the clustering and space density 
in the sense that the clustering prefers higher bias. This is 
particularly evident at higher redshift and higher stellar mass 
limit. 

tain the derived parameters Mi, the halo mass that on 
average hosts one satellite galaxy, as well as Ug, Fgat, 
Meff, and bg as given by equations ITOl ITTl IT2| and IT3l 
respectively. The best fitting model w{0)s are shown in 
Figure [31 

5.1.1. A Discrepancy Between the Clustering and Space 
Density in the Halo Model? 

If the halo model is a good representation of our data, 
one would expect the fits based on just the clustering 
and those based on the clustering and space density to 
be consistent. Several previous studies of high redshift 
clustering have reported difficulty in simultaneously fit- 
ting both the space density and clustering within the halo 
model. For example, Quadri et al. (2008;Q08) report 
this issue for distant red galaxies (DRGs) at 2 < z < 3 
and most recently .Matsuoka et al., (,2010.) show a simi- 
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TABLE 2 

HOD AND DERIVED PARAMETERS FROM FITS TO THE CLUSTERING ONLY. 
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Note. — Mmin and are the fitted HOD parameters. Mi, rig, bg, Fsat and M^jj are all derived parameters and are respectively 
the mass scale at which a halo hosts 1 satellite on average, the mean galaxy number density, the average linear bias, the satellite fraction, 
and the effective halo mass. The stellar mass (SM) is in units of 10^ Mq, halo masses are in units of lO^^/i^^MQand density has units 
lO-^h^Mpc-^. The errors are 1 sigma marginalized over the other parameters. 



TABLE 3 

HOD AND DERIVED PARAMETERS FROM FITS TO THE CLUSTERING AND DENSITY. 
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Note. — Mmin and M[ are the fitted HOD parameters. Mi, rig, bg, Fsat and M^ff are all derived parameters and are respectively 
the mass scale at which a halo hosts 1 satellite on average, the mean galaxy number density, the average linear bias, the satellite fraction, 
and the effective halo mass. The stellar mass (SM) is in units of lOl A/0, halo masses are in units of 10i^/i~l Af0and density has units 
IQ-^h^Mpc-^. The errors are 1 sigma marginalized over the other fit parameter. 



lar discrepancy for t he most massive galaxies at z < 1. 
[Tinker et all (l'2010al) were able to fit the measured DRG 
clustering and space density from Q08 using a better 
justified HO D model a nd the latest halo mass-bias re- 
lation from [Tinker e"Fa l. (20101^ ), wh ich is steeper at 
higher bias than the IsiTeth et al.l (|2001[ ) relation used by 
Q08. However, they required that the field in Q08 have a 
higher clustering amplitude than average, but were able 
to demonstrate using simulations that this was not that 
unusual due to cosmic variance, with just over 16% of 
their simulated surveys showing clustering as strong as 
observed by Q08. 

Figure [3] and the HOD fit parameters in Tables [2] and |3] 
show a small systematic difference between the observed 
density and clustering and that which is predicted by 



the model. This is particularly noticeable for high stel- 
lar mass (high halo bias) and at z — 1.5 and 1.9. The 
discrepancy is in the sense that the HOD model requires 
a smaller space density to match the clustering than is 
observed. We illustrate this explicitly in two ways in Fig- 
ures [6] and [71 Figure [6] shows the bias as a function of 
stellar mass limit determined in three ways. The stars 
and squares show bg from the HOD fits as given by Equa- 
tion [T3] for fits to the clustering and clustering plus space 
density respectively. The circles show the large scale bias 
(bis) calculated by fitting the non-linear matter correla- 
tion function to the clustering on large scales. Both bis 
and bg (fitted to the clustering) show higher values than 
bg fitted to the clustering and density. A similar trend 
is seen in Figure [7] where we show the predicted mean 
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Fig. 7. — The predicted mean density based on the HOD 
fits to the galaxy clustering as a function of the measured 
space density for all stellar mass limits in the three redshift 
intervals. The vertical solid and dotted error bars are the 1 
and 2 sigma errors on the HOD fit density respectively. The 
dashed line shows the one to one relation. All but one of the 
points lie to the right of the one to one relation meaning that 
the model fit to the clustering implies a lower space density 
than is measured. 



density based on the HOD fit to the clustering compared 
to the measured density. All but one of the points lie 
below the one to one relation, although on both plots all 
are within 2 sigma of the expected relation. 

Allowing the HOD parameters that were fixed to vary 
does not resolve this discrepancy. If we make the satellite 
slope a steeper or shallower within reasonable bounds, 
we find that the satellite mass threshold M[ adjusts to 
compensate keeping the satellite fraction the same and 
hardly changing the space density. The same is true for 
Mcut- Adjusting the softening parameter for the cen- 
tral cut off, CTiogM, does effect the predicted density for 
a given clustering amplitude, in the sense that sharper 
cutoffs, smaller values of aiogM, will produce a higher 
density for the same clustering. Our chosen value of 
ciogM is already quite low and there is some expecta- 
tion that it should be higher for our more massive galaxy 
samples, however even an unphysical instantaneous tran- 
sition does not reduce the discrepancy by much. 

Therefore, in our analysis we do see some evidence of 
this discrepancy over two independent fields, and three 
redshift ranges, even though we are usi ng the latest 
halo m ass function and b ias rela tions from [Tinker et ahl 
(|2005[ ) and [Tinker et alj (|2010b[ ). as well as the most up 
to date implementation of the HOD and clustering calcu- 
lation. However, we should emphasize that the individ- 
ual HOD fits that include the space density are perfectly 
acceptable fits and are only slightly worse than the fits to 
clustering alone. It is still striking that we see that this 
systematic offset is apparent for nearly all of our sam- 
ples and particularly those with high bias. The simula- 



tions that we conduct to estimate our errors do contain 
some fields with clustering as strong as we observed for 
our most massive samples at z = 1.5 and 1.9, but these 
are rare (Figure [T^. Even though cosmic variance could 
well be playing some part, it is challenging to explain the 
whole of this systematic trend as cosmic variance, partic- 
ularly when the very similar findings of iMatsuoka et ahl 
(|20T0l) at z < 1 are considered. 

One possible explanation is that there may be a deficit 
of highly biased halos in our model, as a result of ei- 
ther the halo mass function or the halo bias relation that 
we use. We find that modifying the halo bias relation 
to make it slightly steeper at high halo masses resolves 
this problem, although this is a purely arbitrary adjust- 
ment. There are several different calibrations of the halo 
bias relation in th e literature (e.g . Sheth & Tormen 19& 



Sheth et al." 2001; 'Hamana et al. 2001; Seliak fc Warre: 
2004; .Wetzel et al.. .2007.: .Tinker et al. 2010b.), and it is 



of course at the high bias end of this relation, where the 
halos are rarest, and thus the errors largest that we are 
potentially se eing a difference with the observations. We 
note that the [Tinker et al.l (|2010b) bias relation that we 
use here minimizes this discrepancy compared to other 
bias relations in the literature. It is thus possible that 
further calibration of this relation in the high bias regime 
may still be needed, along with more precise clustering 
measurements of sufficiently biased galaxy populations. 

The high bias regime is one that has yet to be accu- 
rately probed with clustering measurements. At lower 
redshifts even the most massive galaxies occupy less 
biased halos. For instance, we could accurately pre- 
dict the space density for the 5700 b rightest Luminous 
Red Galaxies ([E isenstein_etal|[200l in the SDSS DR7 
(lAbazajia n et al.l 12009) by fitting our HOD model to 
their projec t ed 2pt -correlation function calculated as in 
iWake et al.l (j2008a[) . However, even these massive galax- 
ies have a bias of 3, which falls at the lower end of the 
bias values implied by the clustering of our samples where 
we see the predicted and observed densities become most 
discrepant. 

Another possibility lies not with the halos themselves 
but how the galaxies are placed within the halos. We 
make the standard assumption that the satellite galax- 
ies trace an NEW profile within each halo. If this were 
not the case, for instance if the satellites had a broader 
distribution, it may be possible to place more satellite 
galaxies in massive halos, thus increasing the large-scale 
clustering amplitude whilst maintaining the small scale 
amplitude at a level consistent with observations. 

It will be very interesting to see if this problem persists 
with larger surveys in more fields at these redshifts, such 
as the forthcoming NMBS-II, or if we have just been very 
unlucky with cosmic variance in this instance. 

Because of this potential issue we will present the re- 
mainder of our results for both fits to the clustering alone 
and to fits combining clustering and space density. The 
overall trends are the same but there is a stronger stellar 
mass-to-halo mass dependence for the fits to the clus- 
tering alone. However, we will focus mainly on the fits 
that include the space density. Its inclusion provides a 
very strong constraint on the HOD, particularly when 
the small scale clustering is as well measured as it is 
here. We are comfortable doing this since while there 
is a hint of a discrepancy within the model it is not yet 
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large enough to exclude the fits including the density at 
a high significance. 

6. THE STELLAR MASS-TO-HALO MASS 
RELATIONSHIP 

Figure |8] shows the HODs (for the fits to the cluster- 
ing and density), and Figure [9] shows the relationship 
between the halo mass scale parameters for central and 
satellite galaxies, Mmin and Mi, as a function of stel- 
lar mass limit for fits to the clustering (bottom) and the 
clustering and space density (top). Once again the clear 
trend of increasing halo mass with increasing stellar mass 
is visible at all redshifts, with the effective halo mass in- 
creasing by a factors of 2.2 ± 0.2, 2.5 ± 0.3, and 5.7 ± 
0.4 per decade in stellar mass at redshifts 1.1, 1.5 and 1.9 
respectively. The fits that include the density in Figure |9] 
show much smaller errors due to the precision of the den- 
sity measurement and it's constraining power. They also 
typically prefer lower values of the halo mass thresholds 
{Mynin and M[) as discussed in Section lS.l.ll 

Where there is overlap between the stellar mass bins 
there is very little evidence for evolution in the HODs 
with redshift. Only the highest stellar mass bin that is 
in all three redshift ranges shows significant evolution 
in iWmin with redshift, such that Af,„in increases with 
d ecreasing red s hift. 

iZheng et al.l ()2005[ ) show that for the definition of the 
HOD for centrals used here (equation!?]), the median stel- 
lar mass of central galaxies living in halos of mass Mmin is 
the stellar mass limit of the sample. It therefore appears 
that the median stellar mass of central galaxies at a given 
halo mass remains approximately constant as a function 
of redshift between redshifts 1 and 2. Since the halo mass 
is evolving, such that a halo at z — 1.9 will have a higher 
mass at z = 1.1, we must be seeing growth in the central 
galaxy m ass at close to the s ame rate as the halo mass is 
growing. iBrown et all (f200l see a similar very slow evo- 
lution in the halo mass stellar m ass relationship for red 
galaxies at 2 < 1 in the NDWFS. IZheng et al.1 (|2005| ) do 
observe a small amount of evolution in the halo mass lu- 
minosity relationship for central galaxies at z < 1, using 
samples from the SDSS and DEEP2, although the use 
of luminosities measured in different rest frame bands 
c omplicates the inter pretation. 

iZehavi et al.l (|2010D propose a parameterization of the 
stellar mass-to-halo mass relation for central galaxies 
consisting of a power-law component at high halo masses 
and a exponentially declining component at low halo 
masses given by 

where A, Mt and aM are free parameters. aM is the 
power-law slope, Mt is the transition halo mass marking 
the point where the dominance of the two components 
switch, and A is the normalization giving the median 
stellar mass at Mj. This form is well fit to the HOD 
measurements at z ~ 0.1 where there is a clear shoulder 
in the stellar mass-to-halo mas s relationship at around 
10^2 /t-iM^ (IZehavi et al.ll20lol) . 

We fit this relation to the data shown in Figure [H] 
and find best fit values oi A = 2Atii x 1O^°M0, 



an = 0.741^:^° and Mt = 172t^g x 10^° Mq shown 
as the solid line. Due to the limited range in stellar 
mass that our samples cover this fit is quite poorly con- 
strained, although we might expect to be covering the 
transition region. We find that the best fit transition 
mass at 1 < z < 2 is a f actor of 4 higher th an that found 
at z - 0.1 in the SDSS (|Zehavi et al.|[20Tol) . 

6.1. The Halo Occupation of Satellite Galaxies 

Figure ini shows both the central mass scale Mmin and 
the satellite mass scale A/i as a function of stellar mass, 
where Mmin is the halo mass at which a halo hosts on av- 
erage 0.5 central galaxies and Mi is the halo mass hosting 
on average 1 satellite galaxy. The dashed line in Figure 
[9] shows the best fit relation to Mmin scaled by a factor 
of 6, which appears to fit the Mi stellar mass relation 
pretty well. This offset implies that a halo hosting two 
galaxies, one central and one satellite, is typically more 
than 6 times as massive as a halo hosting just one cen- 
tral galaxy. Halos with masses in between A'/min and Mi 
will typically host a single central galaxy with an above 
average stellar mass. 

Observ ations from the SDSS at z ^ 0.1 fin d Mi ~ 
17Mmin (|Zheng et al.ll2007t [Zehavi et al.ll2010D . almost 
three times the difference we observe at z > 1. There are 
two reasons for this: First, our samples have higher stel- 
lar mass limits than the typical samples from the SDSS, 
and second, we expect there to be evolution with redshift. 
The stellar mass limit is important as the Mi /Mmin ratio 
is seen to become rapidly smaller for highe r stellar mass 
(luminosity) limits (e.g. see Figure 12 of IZehavi et al.l 
I2OIQ) . This dependence on halo mass is thought to be 
caused by the fact that the more massive the halo the 
later it is expected to form at any given epoch, so that 
there is less time for satellites to merge on to the central 
galaxy in more massive halos. 

Even though the samples in IZehavi et al.l (|2010[ ) are 
luminosity limited, rather than mass limited, and at a 
different redshift we can make a reasonably good com- 
parison by matching the space densities. In doing this 
we are assuming that the N most massive galaxies at one 
redshift correspond to the N most massive at another. 
At over-lapping space densities the Mi /Mmin ratio varies 
from 17 to 6, with a typical value of ~ 10 at z ~ 0.1, and 
is much more consistent with our values for the lowest 
space densities (highest masses) . We don't see much of a 
dependence of this ratio on stellar mass within our data, 
which could simply be the result of the relatively small 
stellar mass range of our samples or it could show a gen- 
uine reduction in the stellar mass dependence at high 
redshift. 

The expected redshift dependence has a similar ori- 
gin as the stellar mass dependence, in that for any 
given halo mass the time with which satellites have to 
merge with the centr al galaxy is less at earlier epochs. 
iKravtsov et al.l ()2004[ ) use high resolution dissapationless 
N-body simulations to investigate the HOD and predict 
that Mi/Afmin should have 2/3 of its z = value by z = 1 
and 1/3 by z = 3. This prediction is roughly consistent 
with the reduction we observe compared to z ~ 0.1. 

Figure [TO] shows the relationship between satellite frac- 
tion and stellar mass limit for the HOD fits to the clus- 
tering {bottom) and the clustering and density combined 
{top). The HOD fit that includes the density shows 
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Fig. 8. — The mean number of galaxies per halo as a function of halo mass, the HOD, (top) and the mean number of galaxies 
per halo times the number density of halos as a function of halo mass (bottom). The total, central and satellite contributions 
are shown by the solid, dashed and dotted lines respectively. In each redshift range the typical halo mass increases as the stellar 



mass mcreases. 



higher satellite fractions than the fits to the cluster- 
ing alone, although for most samples the significance is 
marginal. The reason for this difference is simple, for a 
given space density HODs with high satellite fractions 
are preferentially placing galaxies as satellites in high 
mass halos, which are more clustered, rather than cen- 
trals in lower mass halos, which are less clustered. For 
our samples, this allows the clustering to remain high, 
even though the space density is large. Of course there 
is a limit to how many satellite galaxies are allowed, de- 
termined by the amplitude of the small scale clustering. 

Both fits show a general trend of decreasing satel- 
lite fraction with increasing stellar mass limit, a trend 
which has previously been seen in clustering measure- 
ments, group catalogs and from len sing at z < 1 (e.g. 
Zehavi et al1[200 5: Zhe ng et al.ll2007HMandelbaum et atl 



limits less than 2 x lO^^M, 



0- 



20061 1 Yang et al.ll2008lV This trend simply results from 



the shape of the halo mass function; at high halo masses, 
the halo mass function has an exponential decline, so for 
a fixed ratio of Mmin-to-Mi halos with mass Mi are in- 
creasingly rare relative to halos of mass Mj^in as these 
mass scales increase. At lower halo masses the mass func- 
tion flattens and the rate at which the satellite fraction 
changes decreases as the stellar mass limit decreases. 
This flattening in t he satellite fraction is observed at 
z ^ 0.1 in the SPS S (IZheng et al.ll2007HYang eralll200l 
IZehavi et al.ll20l"ol) and there is evidence of this flatten- 
ing in the measurements presented here for stellar mass 



6.2. 



Star formation efficiency in central galaxies 

A more physically intuitive way to view the stellar 
mass-to-halo mass relation for central galaxies is to con- 
sider how the ratio of the central galaxy stellar mass- 
to-halo mass depends on halo mass. Assuming that the 
baryon fraction is constant with halo mass then this ratio 
gives the efficiency with which baryons have been con- 
verted to stellar mass in the central galaxy. The top 
panel of Figure [TT] shows this ratio as a function of halo 
mass for our three samples along with the analytic rela- 
tion derived from equation [TH (Zehavi et al. 2010) 



A 



Mt 



exp 



Mh 



- 1 



(15) 



with A, Mt and um the best fit parameters from be- 
fore. Here A/Mt is the halo mass-to-stellar mass ratio 
at Mt- This analytic relation is shown as the solid line 
in Figure [TT] with the \a confidence region shown by the 
dashed lines. We see a steady rise in the star formation 
efficiency as halo mass increases, which plateaus at high 
halo masses and shows a hint of a turning over in the 
highest mass bin. We see little evidence of redshift evo- 
lution within our sample, although the turnover may be 
occurring slightly early at lower redshift. 
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Stellar Mass (10^" M^) 

Fig. 9. — The relationship between steUar mass and halo 
mass for the central and satellite galaxies from the HOD fits to 
the clustering (bottom) and the clustering and density (top). 
The solid lines in both panels show the best fit stellar mass 
Mmin relation of the form given in equation[T3]to the cluster- 
ing and density HOD fits. The dashed lines show the Mmin 
relation scaled up by a factor of 6. Higher mass thresholds are 
typically preferred by the fits to just the clustering than by 
the fits to both the clustering and density. Both the satellite 
and central halo mass thresholds increase as the stellar mass 
limit increases. 



In the middle and bottom panels of Figure [TT] we 
compare to other measurements from the literature at 
z = 0.1, z = 1, z = 1.5 and z = 2. T he open crosses and 
circles in the middle panel are from iZheng et all (|2007[ ) 
and are based on HOD fits to clustering measurements 
in the SDSS and DEEP2 surveys. The blue and cyan 
shaded area in the middle panel show the la confidence 
interval of this relation derived by sub-halo abundance 
matching (SHAM) to the SDSS stellar mass function at 
z = 0.1 and stellar mass function measurements from 
iFontana et al.l (I2006D in the Chancl ra Deep Field South 
at z = 1.5 from lMoster et al.l ()2010D . 

The bottom panel again shows further SHAM measure- 
ments from Bchroozi et al. (2010) who again make use of 
the SDSS stellar mass function at z = 0.1, but use more 
recent stell ar mass functions f rom [ Perez-Gonzalez et al.l 
(|2008f) and iMarchesini et al.l (|2009D at higher redshift. 
The solid errors on these points show the random errors 
on the measurements with the dotted errors showing the 
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Fig. 10. — The relationship between the fraction of galaxies 
above a stellar mass limit that are satellites in halos (rather 
than centrals) and the stellar mass limit determined from the 
HOD fits to the clustering (bottom) and the clustering and 
density (top). In both panels the satellite fraction appears to 
reduce as the stellar mass increases. Including the density in 
the fit results in higher satellite fractions. 



systematic errors. 

All of the litera ture measurements use 
iBruzual fc Chariot! ()2003() s tellar po p ulatio n syn- 
thesis models rather than the iMarastonI ()2005[ ) models 
that we have used. We have thus applied a correction to 
our stellar masses in the middle and bottom panels to 
account for this difference. Bchroozi et al. (2010 ) also 
assume a d ust model from Blanton fc Roweia (|2007D 
rather than iCalzetti et al.l ()2000D and so we further 
adjust of results in the bottom panel to reflect this. 

All three SDSS relations show a characteristic max- 
imum star formation efficiency at a halo mass of 
around 6 x 10^^ h~^MQwith a steep decline either 
side, corresponding to a fall in efficiency at higher 
or lower halo mass. This peak efficiency has also 
been seen in several other studies at low redshift using 
the HOD, the Conditional Luminosit y Function, group 
catalogs, SHAM and weak lensing (lYang et al.l 2003 ; 



Eke et al.ll2005tlTinker et al.ll200,5tlVare fc Ostrikeril200(i 



Mandelbaum et al.ll200a iGuo et al.ll2010l) . 

When compared with these z ~ results our mea- 
surements show that the halo mass-to-stellar mass vs. 
halo mass relation has changed significantly with red- 
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Fig. 11. — Top panel: The relationship between the ratio of 
stellar mass-to-halo mass and halo mass for central galaxies 
determined from the HOD fits to the clustering and density. 
This ratio is proportional to the efficiency with which baryons 
have been converted to stars. The solid line shows the best fit 
relation of the form given in equation[T2]determined from the 
fit to the stellar mass-to- Mmin relation shown in Figure[8l The 
dashed lines show the la confidence region on this fit. The 
star formation efficiency increases as the halo mass increases, 
before plateauing at high halo mass. Middle and bottom pan- 
els: Comparisons to simila r measurements in t h e literature at 
z = q .l, 1, 1.5 and 2 from IZheng et all (120071'). IMoster et all 
||2010D (middle panel) and IBehroozi et al.l (|2010l ') (bottom 
panel). We have adjusted our measurements in these panels 
to match the stellar population modeling assumptions used in 
the literature determinations (see text for details). The peak 
in the relation, which corresponds to the peak in efficiency in 
the conversion of baryons to stars in the central galaxy, has 
shifted to higher halo masses at higher redshift. 

shift, with the peak in the star formation efficiency mov- 
ing to higher halo masses at higher redshift. It is unfor- 
tunate that the area of our survey does not permit us 
to probe to slightly higher stellar masses, and thus halo 
masses, as it is not clear that we are definitely seeing 
the expected down turn in M^/M^ at high halo masses. 
However, there is a definite fiattening in all three redshift 
ranges and some evidence of a down turn in the highest 
mass bin in the z = 1.9 sample. This turn over becomes 
more apparent in the bottom panel after the corrections 



to stellar population model and dust law are applied. 

The literature measurements also show a shift in the 
peak efficiency to higher halo masses at higher redshift. 
These are broadly consistent with our measurements, al- 
though our peak efficiency is at somewhat higher halo 
masses and the rate of down turn at low halo masses 
appears to be faster. 

It is not surprising that th ere is not an exact agree- 
ment with IZheng et al.l ()2007f) since they use luminosity 
limited samples, limited in the rest B band at z ^ 1, 
resulting in a color dependent selection that is not stel- 
lar mass limited. Cosmic variance caused by the small 
field used for the st ellar mass function measurement in 
IMoster et al.l ()2010D . along with small differences in the 
stellar mass determinations and cosmology could easily 
explain the difference between our relations. 

Our measurement at z = 1 lies ju s t abo ut within 
the systematic error of IBehroozi et all ()2010t ). however 
we have adjusted our stellar masses to match their 
model, and uncertainties in the stellar population synthe- 
sis model masses are the dominant factor in their system- 
atic error determination. We assume the same cosmology 
and whilst there are slight differences in our halo mass 
definitions one would expect better agreement, particu- 
larly as the NMBS data produce stellar mass functions 
which very closely match those used by IBehroozi et al.l 
(|201C1[) . The largest difference is for lowest stellar mass 
limited samples where our best fit halo masses woul d 
need to be about 30% lower than IBehroozi et al.l (|2010( ). 
The cause of this difference appears to be the fact that 
we are fitting the clustering as well as the space den- 
sity of galaxies, whereas the SHAM just uses the space 
density of galaxies. For a given stellar mass limit we 
could easily reduce the value of Mmin in the halo model 
(which gives the median central stellar mass) if we made 
M[ higher thus reducing the satellite fraction. However, 
this would reduce the clustering amplitude on both small 
and large scales and be incompatible with our clustering 
measurements. At lower stellar masses the SHAM model 
is assigning fewer galaxies to be satellites and more to be 
centrals than appears to be compatible with our clus- 
tering measurements. This illustrates the importance of 
testing the assumptions that go into the SHAM model 
with additional observations such as the clustering. 

Returning to the redshift evolution of the star for- 
mation efficiency the shift in the overall normalization 
of this relation just refiects the continued conversion 
of baryons into stars. If we assume a universal baryon 
fraction of 16.9% then we can convert the halo mass-to- 
stellar mass ratio to the fraction of baryons converted to 
stars in the central galaxy. At z 1.5 this stellar bary- 
onic fraction peaks at about 10%, whereas at z = it 
peaks at 25%. At high halo masses the high redshift and 
low redshift relations approach each other. This means 
that central galaxies in high mass halos are growing 
much more slowly in stellar mass than their host halos 
are growing in dark matter mass. It does not mean that 
the total fraction of baryons in stars has hardly changed, 
just that these stars are not being effectively added to 
the central galaxy and may be in either satellites or in 
the diffuse intra halo background. The opposite seems 
to be the case at lower halo masses where there appears 
to be an increasing rate of stellar mass build up for 
centrals, compared to the rate of growth of their parent 
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halos. This is in a sense another manifestation of the 
'downsizing' paradigm, but for halos. The most massive 
halos are more efficient in converting baryons to stars at 
high redshift and are becoming increasingly inefficient 
in doing so as time goes on compared to lower mass halos. 



7. SUMMARY AND CONCLUSIONS 

We present here a detailed analysis of the clustering of 
galaxies as a function of their stellar mass at 1 < z < 2 
using data from the NEWFIRM Medium Band Survey. 
The precise nature of the NMBS photometric redshifts 
allows us to define samples that are very close to being 
volume limited with accurate stellar mass estimates. We 
find the following: 

(i) In all three redshift slices we see a significant de- 
pendence of the clustering amplitude on the stellar mass 
limit of the sample, both when comparing the 2-point 
angular correlation functions directly, or when consider- 
ing the amplitude of a power law fit to the correlation 
function measurements. This shows that the strong stel- 
lar mass dependent clustering seen at < 1 persists up 
to z 2. 

(ii) Wc fit halo models to our meas u remen ts using the 
form of the HOD from IZheng et all (j2005D . fitting for 
the central and satellite mass thresholds Mmin and M[ 
respectively. We find that both Mmin and M{ show a sig- 
nificant increase with increasing stellar mass limit, con- 
firming that at strong stellar mass-to-halo mass relation- 
ship is in place at 1 < z < 2. 

(iii) For the HOD definition that we use, the stellar 
mass limit of our samples corresponds to the median stel- 
lar mass of central galaxies hosted by halos of mass Mmin- 
We see little evidence for any evolution in this relation- 
ship between central stellar mass and halo mass within 
our sampled redshift range, although there is clear evi- 
dence that it evolves from the redshift zero relation. 

(iv) We determine the efficiency with which baryons 
are converted into stars in central galaxies as a function 
of halo mass by calculating the stellar mass-to-halo mass 
ratio. We see a peak star formation efficiency in halos 
of mass ~ 3 X 10^^ h~^MQ with a clear decrease in effi- 
ciency at low halo masses. There is some weak evidence 
of a downturn at high halo masses, but this conclusion is 
limited since the size of our survey prevents us from prob- 
ing stellar mass limits greater than 10^^ Mq due to the 
small number of galaxies with these high masses. Mea- 
surements at z ~ show a peak efficiency in halos of 
~ 7 X lO^^/i~^M0, providing clear evidence of a shift in 
peak efficiency to higher halo masses at higher redshift. 
This halo 'downsizing' is a similar phenomenon as the 
galaxy 'downsizing' seen as a function of galaxy mass. 

(v) We find evidence that the fraction of satellite galax- 
ies increases as the stellar mass limit decreases in a simi- 
lar manner to that seen at z ~ 0. The ratio between the 
central and satellite mass thresholds Mmin and M{ re- 
mains approximately constant over the relatively narrow 



range in stellar mass that we probe. The ratio does ap- 
pear to have decreased compared with the z ~ values, 
consistent with the basic expectations of N-body simula- 
tions. 

(vi) We show some evidence that the halo model is un- 
able to reconcile both the observed clustering and space 
density of highly biased galaxies at z > 1. The signifi- 
cance of this is marginal, and it is possible that we could 
have been unlucky with cosmic va riance; but taken with 
other similar recent observations (iMats uoka et al.ll2010( ) 
it may imply that there is an incompleteness in the halo 
model, for instance the halo bias relation may need to be 
adjusted in the high bias regime or galaxies may not be 
distributed within halos following an NFW profile. 

There remains much scope for furthering this work in 
the near future. The forthcoming NEWFIRM Medium 
Band Survey II, which will cover 10 times the area as 
NMBS in 5 fields, but to shallower fiux limits, will enable 
us to extend this work to higher stellar masses. This 
should allow the down turn in the stellar mass-to-halo 
mass ratio halo mass relation to be precisely pin-pointed 
as well as resolve whether there is an issue with the halo 
model at high redshift and high bias. 

The ultimate goal of this work will be to use the halo 
model framework to place constraints on the evolution 
of the galaxy population, by combining these measure- 
ments with those at lower re dshifts as has b een done by 
ej?. Zheng ct al. (2007); Whi te et all (pOOTi ) ; Imke et al.l 
p008^); IBrown et al.l ((2008) . We have aheady begun 
this process and in a forthcoming paper we will combine 
the measurements presented here at 1 < z < 2 with mea- 
surements of clustering as a function of stellar mass in 
the SDSS at z = 0.1, where we have defined the stellar 
mass in a consistent manner (Wake et al. 2010 in prep.). 
This will allow us to see if the stellar mass halo mass 
relation remains constant over this range and, by com- 
bining it's evolution with the evolution of the halo mass, 
determine how central galaxies have grown as a function 
of cosmic time over the last lOGyrs. 
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Fig. 12. — The measured and mock angular correlation functions for the most and least massive galaxies in the z — 1.5 sample. 
The blue points show the measurements from the NMBS, the black squares the full box mock, the red stars the mean of the 
individual survey mocks, and the cyan lines the individual survey mocks. All have been corrected for the integral constraint 
where appropriate. 



APPENDIX 

ESTIMATING ERRORS ON THE 2-POINT CORRELATION FUNCTION 

Making an accurate estimation of the errors on a correlation function measurement is particularly challenging, and 
there are several established methods in the literature, all of which have advantages and disadvantages. Since the 
individual data points in any correlation function are highly correlated, particularly on large scales, it is vital to 
estimate the full covariance when determining the errors. This immediately rules out the use of Poisson errors, which 
despite these problems are still often used. 

This leaves two broad classes of method: internal estimators such as bootstrap or jackknife resampling techniques 
that make use of the data themselves, and mock catalogs, whereby multiple fake data sets are created and compared. 
The advantage of the internal estimators, is that they include all of the real correlations in the data, something which 
mock catalogs may not. The disadvantage is that they are of limited size and so can produce noisy estimates of the 
covariance matrix an d may also show dependence on the numbers and size of the sub-regions used when re-sampling 
()Norberg et al.|[2009t) . 

We made an initial attempt to use the jackknife resampling technique to estimate the covariance on our measure- 
ments. We define multiple sub-regions with equal area and then recalculate the correlation function removing one 
subregion at a time to calculate the covariance. We perform this procedure multiple times splitting our data in to 9, 
25, 49 and 100 sub-regions. Comparing the covariances we find a strong dependence of the magnitude of the errors on 
the number of sub-regions we use in the jackknife calculation, particularly on large scales, an effect reported previously 
in Norberg et al. (2009; but see also Zehavi et al. 2005). The strong large scale dependence is not surprising since the 
size of our sub-regions is comparable or smaller than the scale at which we wish to estimate the error. However, we 
still see a systematic correlation at smaller scales, such that the errors increase as the number of sub-samples increases. 
For this reason we believe that mock catalogs pr ovide a significantly better approach for our analysis. 

We choose to use the Millennium simulation (jSpringel et al.l [20051 ) to generate our mock surveys, which is almost 
ideal for our purposes. For each redshift sample we download a halo catalog with halo masses > 7 x lO^'^h^-^ Mq at 
the output with the redshift closest to the mean of our sample, redshifts 1.08, 1.50 and 1.91 respectively. 

We then make a HOD fit to the correlation function and density of each of our samples using the method described 
in Section [51 We make use of the covariance matrices generated using the jackknife method with 9 sub-samples. 
There is considerable noise in these covariance matrices due to the small number of jackknifes and we mitigate this by 
restricting the fit to use just 95% of the covariance based on a principle component analysis following the techniques 
proposed bv IPorciani fc Norbergl (|2006( ): iNorberg et all (|2009| ). This is more than adequate to generate the mocks, 
where we are just interested in the best fitting HOD and not concerned with confidences etc. 

We fit to both the density and clustering as it is important that we get the best match to both in the mocks. If we 
just fit to the clustering, something we tested, we see that the density in the mocks is much lower than in our samples, 
particularly at high stellar masses and at z = 1.5. This leads to a substantial over estimation of the errors simply as 
a result of increased shot noise. 

The Millennium simulation has as = 0.9, where as the currently favored value, and the one we assume, is 0.8. When 
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fitting the halo model in this instance we therefore use erg = 0.9. Despite this difference the resulting mocks are able 
to reproduce the clustering and space density of our samples with the same accuracy as the sigmas — 0.8 HOD fits 
and so we are happy to use these to estimate our errors. 

The dark matter halo catalogs extracted from the Millennium simulation are then populated following the best fitting 
HOD for each sample, placing the central galaxies at the center of the halos and distributing the satellites following 
an NFW profile. The whole Millennium box is then split up into multiple copies of our full survey geometry. The 
Millennium box size of 500h~^Mpc is not quite large enough to accommodate the full size of our samples in the redshift 
direction and so we wrap the box in order to accomplish this. We are able to generate 170, 128, and 98 realizations of 
a single field at redshifts 1.1, 1.5 and 1.9 respectively. When generating the mocks for both fields combined we ensure 
that each pair of fields is separated by at least 100h~^Mpc so they are not correlated. 

The angular correlation function is then calculated for the full box mock, and in each of the survey mocks. The 
covariance for each stellar mass limited sample is determined from the multiple survey mocks. We make a direct 
estimate of the integral constraint for the mock surveys by comparing the mean correlation function of the survey 
mocks to the correlation function of the full box mock. 

Figure [T2] shows the results of this process for the lowest and highest stellar mass limited samples at z — 1.5 with 
both fields combined. The blue points on each plot show the correlation function measurement for the NMBS sample, 
the black squares show the correlation function from the full box mock, and the red stars indicate the mean of the 
individual survey mocks corrected by the integral constraint. The cyan lines show the individual correlation function 
measurements from each survey mock. In all cases the mean survey mocks agree with the full box mock as expected. As 
discussed in Section 15.1.11 for the high stellar mass sample the clustering measurement is higher than that determined 
from the mock, although there are some individual survey mocks that do show this level of clustering. 

The errors estimated in this way treat each stellar mass limited sample as if it were independent of the others in 
our survey. As such we estimate the error one would expect if the correlation function was determined at a given 
stellar mass limit in a given redshift interval on any piece of sky. When comparing our stellar mass limited samples 
with each other they are not independent as they are measured in the same volume each time (the same piece of 
sky) . It is almost certainly the case that if in one particular volume the clustering amplitude of galaxies more massive 
than 1 X lO^°Af0 is higher than average the clustering amplitude of galaxies more massive than 5 x will also 

be higher. Our error estimates ignore this and we treat our measurements as if they had been made in independent 
volumes when testing for stellar mass dependent clustering. In this case we are most likely overestimating the error 
on any mass dependent clustering and underestimating its significance. As such we have estimated the error in a most 
conservative fashion and we still find significant stellar mass dependent clustering. We therefore choose not to pursue 
the much more complex simulations that would be required to fully model this effect. Of course our error estimates 
are perfectly correct if one wished to compare our measurements with similar ones estimated from another region of 
sky 



HALO MODEL 

Here we present a more detailed overview of our halo model calculation. 



The fraction of halos of mass M which host centrals is modeled as 

/ logM - logMr, 



{Nc\M) = i 



1 + erf 



C^logM 



(Bl) 



where erf is the error function M is the halo mass and Afmin and criog m parameterize the HOD. Only halos which 
host a central may host satellites. In such halos, the number of satellites is drawn from a Poisson distribution with 
mean 

(B2) 

Thus, the mean number of galaxies in halos of mass M is 

{N\M) = {N,\M)[1 + {N,\M)l (B3) 
and the predicted number density of galaxies is 



dMn{M){N\M), (B4) 

where n{M) is the halo mass function, for which we use the latest parameteriza tion given by iTinker et al.l (|2010bf ). 

We further assume that the satellite galaxies in a halo trace an NFW profile (jNavarro et al.lll996l ) around the halo 
center, and that the halos are biased tracers of the dark matter distribu tion. The halo bias (h( M)) depends on halo 
mass in a way that can be estim ated directly from the halo mass function (jSheth fc Tormenll 19990 . and we use the most 
up to date parameterization of 'Tin ker et al.l ()2010bf) . With these assumptions the halo model for ^(r) is completely 
specified (e.g. iCoorav fc Shetb,2002i ). We then calculate w{9) from ^(r) using equation (jS)). 
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In iWake et al.l ()2008al lb[) we used the linear theory power spectrum {PLin{k)) throughout the calculation, whereas 
we now use the non-linear power spectrum when calculating the 2-halo term. We also apply the scale dependent bias 
and halo exclusion 

In addition to £,{r), we are interested in the satellite fraction, 



Fsat = J dMn{M) {N,\M) {N,\M)/ng. (B5) 
and two measures of the typical masses of galaxy host halos: an effective halo mass 

Meff = J dM Mn{M) {N\M)/ng, (B6) 

and the average linear bias factor 

&g I dMn{M)b{M){N\M)/Tig, (B7) 



where b{M) is the halo bias. 

Our notation is intended to make explicit the fact that the mean number density of central-satellite pairs from such ha- 
los is n(M) {Nc\M) {Ns\M}, and the mean number density of distinct satellite-satellite pairs is n(M) {Nc\M) {Ns\M)'^/2 
(because we are assuming the satellite counts are Poisson). 

Our model for the real-space 2-point function is 



where 



({r) = l+ iclr) + 1 + („lr) + {,„(r) (B8) 

J rig UgM 

1+6.M=/^m "W™"' '"-I"'' ^'-1") (BIO, 

J rig 2 UgM^ 

f dk k'^ P2h{k) sinkr 

^^'^^'^^JT^^ k^ (^^'^ 

P2h{k,r)=hg{k,rf P{k), (B12) 
bg{k,r)= dM-^ — '-b{M,r){Nc\M) 1 + {Ns\M)u{k\M) . (B13) 



and 

with 
where 



In the expressions above, p(r|M) is the density profile of halos of mass M, X{r\M) denotes the convolution of two 
such profiles, u{k\M) is the Fourier transform of p{r\M)/M, and P{k) denotes the non-linear theo ry pow er spectrum 
at the redshift of interest. The scale dependent bias {b{M,r)) is calculated following [Tinker et al.l ()2005l) as 

where S,m{r) is the non- linear real-space matter 2pt correlation function. The upper limit to the integral, Mumir), as 
well as the use of the restricted number density, n'g in equatio n IB131 take into account the effect of halo exclusion, 
with Mum{r) determined following the method of iTinker et al.l (|2005f ) and 



MH„(r) 

dMn{M){N\M). (B15) 

All these quantities, along with the mass function n{M) and bias factor b{M,r), are to be evaluated at the redshift 
of interest. We have already specified how, for a given halo mass, the virial radius depe nds on redshift: the N FW halo 
density profile is also specified by its concentration, for which we assume the relation of iBullock et al.l (|2001| ). All this, 
in the right hand side of equation gives the halo model calculation of w{9). 
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